|
|
Accession Number |
TCMCG075C24746 |
gbkey |
CDS |
Protein Id |
XP_017981047.1 |
Location |
join(16825695..16826081,16826629..16827060,16828997..16829555,16831109..16831404) |
Gene |
LOC18592971 |
GeneID |
18592971 |
Organism |
Theobroma cacao |
|
|
Length |
557aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018125558.1
|
Definition |
PREDICTED: probable alkaline/neutral invertase D isoform X1 [Theobroma cacao] |
CDS: ATGGATGGGACTAAAGAGATGGGACTTAGAAATGTGAGCTCAACCTGCTCAATTTCCGAAATGGATGATTATGATCTGTCACGCCTTCTTAACAAGCCAAAGCTTAACATAGAGAGGCAAAGATCATTTGATGAGAGGTCACTAAGTGAGCTCTCTATTGGTCTCACTAGAGGAAGCTATGACAATTATGAGACCACCCACTCGCCTGGTGGGAGGTCAGGTTTTGATACTCCGGCTTCATCAGCTAGAAATTCCTTTGAACCTCACCCCATGGTGGCTGAAGCATGGGAAGCTCTCAGGAGATCATTGGTGTATTTCAGAGGCCAACCCGTTGGTACCATTGCCGCATATGATCATGCTTCTGAGGAAGTTTTGAACTATGATCAGGTTTTTGTTCGAGATTTTGTACCCAGTGCTCTGGCTTTTCTGATGAATGGAGAGCCTGAGATAGTTAAGAACTTCCTCTTGAAGACCCTACAACTTCAAGGGTGGGAGAAAAGAATAGATAGATTCAAGCTAGGGGAAGGTGCAATGCCAGCTAGCTTCAAAGTGCTTCATGATCCTGTACGTAAAACAGACACAATTATTGCAGATTTTGGAGAGAGTGCCATTGGACGAGTTGCTCCAGTTGACTCTGGATTTTGGTGGATAATTCTGCTCCGTGCATATACAAAATCTACCGGGGATTTATCTCTTGCGGAGACACCTGAGTGTCAAAAAGGAATGAGGCTCATACTTACTCTGTGTCTATCAGAAGGATTTGATACATTCCCAACCCTACTTTGTGCTGATGGATGCTCTATGATTGATCGAAGAATGGGTATTTATGGTTATCCTATTGAAATTCAAGCACTTTTCTTTATGGCGTTGAGGTGTGCTTTATCAATGCTGAAGCATGATGCAGAAGGAAAAGAGTGCATTGAAAGAATTGTAAAGCGTTTGCATGCCTTGAGTTATCACATGCGCAGTTACTTTTGGCTTGACTTTCAACAACTAAATGATATTTACAGATATAAAACTGAGGAATATTCTCACACAGCAGTAAATAAGTTTAATGTTATTCCTGATTCAATTCCTGACTGGGTATTTGATTTTATGCCAACACGAGGTGGCTACTTTATTGGCAATGTTAGTCCTGCAAGGATGGATTTCCGATGGTTTTGTTTAGGTAACTGTATAGCAATCCTATCTTCTCTTGCAACTCCAGAGCAATCAATGGCTATAATGGACCTTATTGAAGCCCGTTGGGATGAGCTTGTTGGAGAAATGCCTTTAAAAATAGCTTATCCTGCAATAGAAAGTCATGACTGGCGAATTGTCACTGGTTGTGACCCTAAGAACACGAGATGGAGTTATCACAATGGAGGATCCTGGCCAGTGCTTTTGTGGTTGCTAACTGCTGCTTGCATCAAGACGGGAAGACCACAAATTGCAAGACGAGCTATTGATCTTGCTGAGACACGTTTGCTGAAAGATAGCTGGCCAGAATATTATGATGGCACACTTGGGAGATTTATTGGTAAACAGGCTCGGAAGTATCAGACATGGTCAATAGCAGGATATTTAGTGGCAAAAATGATGCTAGAGGATCCGTCTCACTTGGGGATGATTTCTCTGGAAGAGGACAAGCAGATGAAGCCATTGATAAAGAGATCATCTTCTTGGAATTGCTAA |
Protein: MDGTKEMGLRNVSSTCSISEMDDYDLSRLLNKPKLNIERQRSFDERSLSELSIGLTRGSYDNYETTHSPGGRSGFDTPASSARNSFEPHPMVAEAWEALRRSLVYFRGQPVGTIAAYDHASEEVLNYDQVFVRDFVPSALAFLMNGEPEIVKNFLLKTLQLQGWEKRIDRFKLGEGAMPASFKVLHDPVRKTDTIIADFGESAIGRVAPVDSGFWWIILLRAYTKSTGDLSLAETPECQKGMRLILTLCLSEGFDTFPTLLCADGCSMIDRRMGIYGYPIEIQALFFMALRCALSMLKHDAEGKECIERIVKRLHALSYHMRSYFWLDFQQLNDIYRYKTEEYSHTAVNKFNVIPDSIPDWVFDFMPTRGGYFIGNVSPARMDFRWFCLGNCIAILSSLATPEQSMAIMDLIEARWDELVGEMPLKIAYPAIESHDWRIVTGCDPKNTRWSYHNGGSWPVLLWLLTAACIKTGRPQIARRAIDLAETRLLKDSWPEYYDGTLGRFIGKQARKYQTWSIAGYLVAKMMLEDPSHLGMISLEEDKQMKPLIKRSSSWNC |